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HSP53G 20303 bp DNA linear PRI 25-JUN-1997 

Human p53 gene for transformation related protein p53 (also called 
transformation-associated protein p53, cellular tumor antigen p53, 
and non-viral tumour antigen p53) . 
X54156 

X54156.1 GI:35213 

anti -oncogene; cell cycle control; growth suppressor; heat shock 
protein 70; oncogene; p53 cellular tumour antigen; p53 gene; 
phosphoprotein; transforming capacity; tumor antigen. 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 

Futreal, P. A. , Barrett, J. C. and Wiseman, R.W. 

An Alu polymorphism intragenic to the TP53 gene 

Nucleic Acids Res. 19 (24), 6977 (1991) 

92107726 

2 

Chumakov , P . M . , Almazov # V.P. and Jenkins, J. R. 

Unpublished 

3 (bases 1 to 20303) 

Chumakov , P . M . 

Direct Submission 

Submitted (02 -AUG-1990) Chumakov P.M., Engelhardt Inst, of 
Molecular Biology, Academy of Science of the USSR, Vavilov St. 32, 
117984 Moscow, USSR 

See also entries K03199, M14690, M14695, X01405, X02469, M22881-4, 
M22887-8, M22894-8 . 

See also Mol . Cell. Biol. 6:1379-1385(1986); 
and Mol. Cell. Biol. 7:961-963(1987). 

Location/Qualif iers 

1. .20303 

/organism="Homo sapiens" 

/ strain= "Caucasian" 

/ db_xre f = " t axon : 9 6 0 6 " 

/chromosome= " 17 " 

/map="pl3" 

843 . . 19876 

/gene="p53" 

join (843. .94 9,11689. .11790,11906. .11927,12 021. .122 99, 
13 055 . . 13238, 13 320 . . 13432 , 14 000 . . 14109, 14452 . .14588, 
17572. .17678,18599. .19876) 



14681. .14754 
/gene="p53" 

primtranscript 843.. 19876 
/gene= M p53" 

exon 843 . . 949 
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/gene="p53" 
/ number =1 
950 . . 11688 
/gene="p53" 
/number =1 
2581. .2587 
/gene="p53" 

/note="5 1 -ALU flanking" 
2588. .2877 
/gene="p53" 
/rpt_family="ALU" 
2890. .2896 
/gene="p53" 

/note="3' ALU-flanking" 
3915. .3929 
/gene="p53" 

/note="3 ■ ALU-flanking" 

3950. .4223 

/gene="p53" 

/ rp t_f ami ly = " ALU " 

4224 . .4238 

/gene="p53" 

/note="5 • -ALU flanking" 
4319. .4327 
/gene="p53" 

/note="5 * -ALU flanking" 
4328 . .4603 
/gene="p53" 
/rpt_family="ALU" 
4631. .4639 
/gene="p53" 

/note="3' ALU-flanking" 
4786. .5574 
/gene="p53 n 

/note=" rearranged cluster" 
/rpt_f amily= "ALU" 
5802 . .5811 
/gene="p53" 

/note="5 ' -ALU flanking" 

5812. .6100 

/gene="p53" 

/ rp t_f ami ly = "ALU " 

6127 . . 6136 

/gene="p53 " 

/note="3' ALU-flanking" 
6221. .6236 
/gene="p53 11 

/note= "5 » -ALU flanking" 
6237. .6517 
/gene="p53" 
/rpt_f amily= "ALU" 
6531. .6546 
/gene="p53" 

/note="3 ' ALU-flanking" 
6548. .7812 
/gene="p53" 

/note=" rearranged cluster" 
/rpt_f amily= "ALU" 
8703 . .8982 
/gene="p53" 
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/rpt_f amily= "ALU" 
repeat unit 9087.. 9098 

/gene="p53" 

/note="5 ' -ALU flanking" 
repeatunit 9099.. 9377 

/gene="p53" 

/rpt_family="ALU" 
repeat unit 9391.. 9402 

/gene= M p53" 

/note="3* ALU- flanking " 
repeat unit 9513.. 10332 

/gene="p53" 

/note=" rearranged cluster" 
/rpt_f amily="ALU" 
repeat unit 11065.. 11069 

/gene= n p53" 

/note="5 ' -ALU flanking" 
repeat unit 11070.. 11357 

/gene="p53" 

/rpt_f amily= "ALU" 
repeat unit 11374.. 11378 

/gene="p53" 

/note= f, 3* ALU-flanking" 
exon 11689.. 11790 

/gene= "p53 " 
/ number =2 

CDS join (11717. .11790,11906. .11927,12021. .122 99,13 055. .1323 8, 

1332 0. .1343 2, 14 000. .14109,14452. .14 5 88,14681. .14 754, 
17572. .17678,18599. .18680) 
/gene="p53" 
/codon_start=l 
/product= "protein p53" 
/protein_id= " CAA38095 . 1 " 
/db_xref="GI :35214" 

/ translation= "MEEPQSDPSVEPPLSQETFSDLWKLLPENNVLSPLPSQAMDDLM 
L S PDD I E QW FTED PG PDE AP RM P E AAPR VAP AP AAPT P AAP AP AP SWPLSSSVPS QKT 
YQGSYGFRLGFLHSGTAKSVTCTYSPALNKMFCQLAKTCPVQLWVDSTPPPGTRVRAM 
AIYKQSQHMTEWRRCPHHERCSDSDGLAPPQHLIRVEGNLRVEYLDDRNTFRHSWV 
PYEPPEVGSDCTTIHYNYMCNSSCMGGMNRRPILTIITLEDSSGNLLGRNSFEVRVCA 
C PGRDRRTEE ENLRKKGE PHHEL PPGS TKRAL PNNTS S S PQPKKKPLDGE YFTLQ I RG 
RERFEMFRELNEALELKDAQAGKEPGGSRAHSSHLKSKKGQSTSRHKKLMFKTEGPDS 
D" 

intron 11791.. 11905 

/gene= M p53" 

/number =2 
exon 11906 . . 11927 

/gene="p53" 

/number =3 
intron 11928.. 12020 

/gene="p53" 

/number =3 
exon 12021 . . 12299 

/gene="p53" 

/ number =4 
intron 12300.. 13054 

/gene="p53" 

/ number =4 
repeat unit 12588.. 12597 

/gene="p53" 

/note="5 ' -ALU flanking" 
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12598 . . 12882 

/gene="p53" 

/ rp t_f ami ly= " ALU " 

12901. .12910 

/gene= n p53" 

/note= n 3' ALU- flanking" 

13055 . . 13238 

/gene= n p53" 

/ number =5 

13239. .13319 

/gene= n p53" 

/number =5 

13320 . . 13432 

/gene="p53" 

/ number =6 

13433 . . 13999 

/gene="p53" 

/number =6 

13617 . . 13630 

/gene="p53" 

/note="5 ' -ALU flanking" 
13631. .13913 
/gene="p53" 
/rpt_f amily= "ALU" 
13930. .13943 
/gene="p53" 

/note="3' ALU-flanking" 

14000 . . 14109 

/gene="p53" 

/number =7 

14110. .14451 

/gene= M p53" 

/ number =7 

14452 . . 14588 

/gene="p53" 

/ number =8 

14589. .14680 

/gene="p53" 

/number =8 

14681. .14754 

/gene="p53" 

/number =9 

14755 . . 17571 

/gene="p53" 

/number =9 

15171. .16491 

/gene="p53" 

/note=" rearranged cluster" 
/ rp t_f ami 1 y = " ALU " 
16633 . . 17397 
/gene="p53" 

/note=" rearranged cluster" 

/rpt_f amily= "ALU" 

17572 . . 17678 

/gene="p53" 

/number=10 

17679. .18598 

/gene="p53 n 

/number=10 

18076. .18091 
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/gene="p53" 

/note= " 5 1 -ALU flanking" 
repeat unit 18092.. 18371 

/gene= ,, p53" 

/ rp t_f ami ly= "ALU " 
repeat unit 18389.. 18404 

/gene^'pBS" 

/note="3' ALU-flanking" 
exon 18599. .19876 

/gene="p53 n 

/number=ll 
repeatunit 19424.. 19431 

/gene="p53" 

/note="3' ALU-flanking" 
repeat unit 19431.. 19752 

/gene= M p53" 

/rpt_family= "ALU" 
repeat unit 19753.. 19760 

/gene="p53" 

/note="5 ■ -ALU flanking" 
5246 a 4970 c 5107 g 4980 t 



BASE COUNT 
ORIGIN 

1 ttcccatcaa 
61 cttggcgaga 
121 caccgccccc 
181 tctccaggct 
241 gctacccagc 
301 tctatatcct 
361 tcagagtgat 
421 agaaaacgtt 
4 81 gagaagaaag 
541 ttgacacaat 
601 cttgcagaat 
661 gactctgcac 
721 gcccttactt 
781 gattggggtt 
841 aa^tc^agag 
901 cgggctggga 
961 actgaacttg 
1021 ggaaaatcgc 
1081 gatggaagct 
1141 gatgatgggg 
1201 gctttgggga 

12 61 gcgtagggaa 
1321 tcgaaacatt 

13 81 ggggctccat 
1441 ccacaacttc 
1501 ctagatggag 
1561 gggtttgccc 
1621 ttgtttggct 
1681 cggagcagct 
1741 gccggttcct 
1801 gattacttag 
1861 cctccccttc 
1921 taccatggcg 
1981 aagaatgagc 
2041 gaaattttga 
2101 ctgttatcta 
2161 cggaaggggc 
2221 gttcttcctg 



gccctagggc 
agcgcctacg 
ccgccgcctg 
tcagacctgt 
actgatatag 
cataaggctt 
aagggttgtg 
a 999tgtgga 
gatccagctg 
gcaggattcc 
tttccacccc 
cctcctcccc 
gtcatggcga 
ttcccctccc 
ccaccgtcca 
gcgtgctttc 
atgagtcctc 
tggggctggg 
tggctagagg 
atgttaggac 
accttgagcc 
tccctgaaat 
gagaactcat 
tccgaaatga 
cggacctttg 
aaaatccaat 
gccaggccga 
ccacatttac 
cactattcac 
cttacttggc 
ttatggcctt 
tttgaatggt 
tagacaggga 
atatctcatc 
aagagtgcaa 
gctttgggct 
gggcccgcag 
gtaggaggcg 



tcctcgtggc 
ctccccctac 
cagagggcgc 
ctccctcatt 
gcactcagga 
acgtttccat 
aaggagatta 
tattacggaa 
agagcaaacg 
tccaaaatga 
aaaatgttag 
aactccattt 
ctgtccagct 
atgtgctcaa 

999 a 9 ca 99t 
cacgacggtg 
tctgagtcac 
ggtggggcag 
gatcatcata 
catccgaact 
ggcctaaagc 
aaaagatgca 
agctgtatat 
tcatttgggg 
tccttcctgg 
tgaaggctgt 
ggaggaccgt 
atttctgcct 
ccgatgagag 

a 9 a ggg a 99 c 

tgcaaaggca 
gtgccccacc 
gggaaagaag 
ttcccggaga 
tgatgggtcg 
aggccattcc 
gcgtccgtgt 
gaactcgaat 



tgctgggagt 
cgagtcccgc 
agcaggtctt 
caaaaaatat 
atacaacaat 
gtactgaaag 
aataagatgg 
agccttccta 
caaaagcttt 
tttccaccaa 
tatctacggc 
cctttgcttc 
ttgtgccagg 
gactggcgct 
agctgctggg 
acacgcttcc 
gggctctcgg 
tggggactta 
ggagttgcat 
caaagttgaa 
gtacttcttt 
caaagcattg 
tttagagccc 
gtgatccggg 
agcgatcttt 
cagtcgtgga 
cgcaatctga 
cttgcagcag 

999 a 99 a 9 a 9 
tgctattctc 
ggggtatttg 
ccccgggtcg 
tgtgcagaag 
aaaaaaaaaa 
tttgataatt 
agttccagac 
ggtcctccgt 
tcatttctcc 



tgtagtctga 

ggtaattctt 

gcacctcttc 

ttattatcga 

gaataagata 

caatgaacaa 

tgtgatataa 

aaaaatgaca 

cttccttcca 

ttctgccctc 

accaggtcgg 

ctccggcagg 

agcctcgcag 

aaaagttttg 

Atccggggac 

ctggattgg^| 

ctccgtgtat 

gcgagtttgg 

tgttgggaga 

cgcctaggca 

gcacatccac 

aggtctgaga 

atggcatcct 

gagcccaagc 

ccaggcagcc 

agtgagaagt 

gaggcccggc 

catttccggt 

agagagaaaa 

cgcctgcatt 

ttttgatgca 

cctgcaacct 

gcaagcccgg 

agaatggtac 

tgtcgggaaa 

gcaggctgaa 

gcagccctcg 

cgctgcccca 



acgcttctat 
aaagcacctg 
tgcatctcat 
gctcttactt 
gtagaaaaat 
ataaatctta 
agtatctggg 
tttaactgat 
cccttcatat 
acagctctgg 
cgagaatcct 
cggattactt 
gggttgatgg 
agcttctcaa 
actttgcgtt 
taagctcctg 
tttcagctcg 
gggtgagtgg 
cctgggtgta 
gaggagtgga 
ccggtgctgg 
cttttggatc 
agtgaaaact 
tgctaaggtc 
cccggctccg 
gctaaaccag 
agccctgtta 
ttctttttgc 
tgtcctttag 
tctttttctg 
aacctcaatc 
aggcggacgc 
aggcactttc 
gtctgagaat 
aacaatctac 
cgtcgtgaag 
gcccgagccg 
tctcttagct 
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2281 cgcggttgtt tcattccgca gtttcttccc 
2341 ccgtacttac gtcatctttt tcctaaatcg 
24 01 acagcaagtg cacaggaaga tgagttttgg 
2461 cacagaccct tttcatcgtc ccagaaacgt 
2521 accccacctt tattttgatc tccataacca 
2 581 ggaatcagga tgggcgctgt ggctcacgcc 
2 641 ggccgaggcg ggcggattac ttgaggatag 
2701 aatccccgtc tctactaaaa aatacaaaaa 
2 761 cagctattcg ggagggtgag gcaggagaat 
2821 gagccaagat cgtgccacta cactccagcc 
2 881 aaaaaggggg gaatcataca ttatgtgctc 
2941 ctgtctgaca ttcgttcatg ttgtatatat 
3001 tccatcgatt gtatatccgt ccttttgatg 
3061 tgaaataaag ctgctataaa cattcttgta 
3121 tttcttggag aaatacttag gaggggaatt 
3181 aactttttca gaagctctga gttttccaga 
3241 aaggtatggg agttattatg gttgtgccac 
3301 tctaatgtgg tatatccttg tggttgtaat 
3361 gcattttttc atgtgcctat tggccattcg 
3421 ttttacctgt tattttggtt tttttgtttg 
3481 tactttctgg atacaagtcc ttatcagatc 
3541 ttgcgggcct atttgtttgc tttacagagt 
3601 ttttttatgt tcagtatttg ccttgtcctg 
3661 ccagggtcat gaagatataa tcttacattt 
3721 acagtaaggt ccttaagcca ttaattaatt 
3781 taggagtcag tctctggtat ctttcctgta 
3841 tgaaataggc ttcctttctc tactgaatgc 
3901 atagggctac cattttagtg ctattttctt 
3961 tcacactgtt gcccaggcta gagtacaatg 
4021 ccctgggctc aagcagtcct ctagcagcct 
4081 cagctaacta ttttattttt ttgtattgac 
4141 caaactgctg gcctcaagct ttcatcccat 
4201 tgtgagccac catgcctgac ctcttagtgc 
4261 tctctttaaa cgttggagga agaaacagta 
4321 aggaacagac tgggcgcggt ggctcatacc 
43 81 aggggatcat ttgaggtcgg gagttcgaga 
4441 tctactaaaa tacaaaagta gctaggcgtg 
4 5 01 ggctgaggca caagaatccc ttgaacctgg 
4561 gccactgcac tccagcctgg gcaacagagt 
4621 aagaaaaaat agaggaatat ttcccaactt 
4681 aaaccaaaca aggacattat aagaaaagaa 
4741 catgcaacag ctaaccaatt ttagcaaacc 
4801 aggccagtcg cggtggctca cgcctgtaat 
4861 atcacttgag gtcaggagtt tgagaccagc 
4 921 taaaaataca aaaatcaggc tgggcacggt 
4 981 aggccgaggt gggcagatca cgaggtcagg 
5041 aacgccatct ctactaaaaa tacgaaaatc 
5101 cagctactca ggaggctgag gcagaattgc 
5161 ccaagatcgt gccactgcac tccagcctgg 
5221 aaaaaaatta gctgggcatg gtggtgggca 
52 81 gcaggagaat cgcttgaacc caggaggcag 
5341 cactccagcc tgggcaacag agtgagactc 
5401 aatacattct aaccaaataa tgtttatctc 
5461 tggcctggtg cagtagctca ggcctgtaat 
5521 atctcttgag cccaggattt caagaccagc 
5581 ggggggaaaa aaatcagtct gtgtaattca 
5641 tatgatcatt tcaacagatg tagcaaaagc 
5701 acaaaccaac caacctccta gcaaactagg 
5761 gcgtccacag tcggagttcc actagcagca 
5821 ggggcggtgg ctcacgcctg taatgccagc 



atgcacctgc cgcgtaccgg ccactttgtg 
aggtggcatt tacacacagc gccagtgcac 
cccctaaccg ctccgtgatg cctaccaagt 
ttcatcacgt ctcttcccag tcgattcccg 
ttttgcctgt tggagaactt catatagaat 
tgcactttgg ctcacgcctg cactttggga 
gagttccaga ccagcgtggc caacgtggtg 
ttagctgggc gtggtgggtg cctgtaatcc 
cgcttgaacc cgggaggcag aggttgcagt 
tgggcgacaa gaacgaaact ccgtctcaaa 
atttttgtcg ggcttctgtc cttcaatgta 
cagtattttg ctccttttca tttagtatag 
gccttttgag ttgtttccca tttgcggtta 
caattctttt tgtgatcata tgttttcgtg 
gtggaggaag taaaaagtag ctgtattttg 
gcggttgtac cattttacac tccaactagc 
agccttccgg acattaggta tgtcagtctt 
ttacagttct ctattgacta aggatgttca 
tattttgttt gtaaagtagc tcttcgagtc 
tttttattgt tcagttgtgg gactgcttta 
catgagtcgt gaatgttttc ttctgatctg 
ttacagaatc ttaagaggag tggattaatc 
tttaggacat cttttttttt ttttttaacc 
tcttttagga cctttatggt ggtaagtttt 
cttaaaatta attgtttatg gtgtgaggtg 
tggaaatcca gttattctgt ctccacttgt 
ttttaatttt aattatttta cagttggagt 
tttttctttg ttaatttttg agacagggac 
gcacaatcaa ggcttactgc agcctcgaac 
cacgagtagc tgggattact ccaccacacc 
aggatctcac tatgttgccc aggctggtct 
ctcggcctcc caaagtgctg ggattacagg 
tattttctat ttatctcctc tgttctctgc 
cccatcttac acaaactctt cagaaaacag 
tgtaatctca gcactttggt acgctgaggc 
ccagcctggc caacacggcg aaaccccatc 
caccatacct gtaatgccag ttactcagga 
gaagcggagg ttgcagtgag ccgagattgc 
gagaccctgt ctcagaaaaa aaaagaaaga 
gttttcgaag ccaggataat cctggtacca 
aatatagacc aatattcctg ttagcataga 
aaacctggta atatagaaaa aaggataaat 
cccagcactt tgggaggctg aggcaggcag 
ctgaccaaca tggtgaaacc ccgtttctaa 
ggctcacgcc tgtaatccca gcactttggg 
agttcaagac cagcctgacc aatgtggtga 
agccggtgtg gtggcacctg cctgtaatcc 
ttgaacccgg gaggcagagg ttgcagtgag 
gcgacagagc aagacttcat ctcaaaaaaa 
cctgaaatcc cagctactcg ggagtctgag 
aagttgcact gagctgggat cacaccattg 
catctcaaaa aaagaaaaag aaaaaggata 
atgattgtag ctgattcaac attcaaaaat 
cccaacattt taggaggctg aggcaggaag 
ctgggcaaca tagtcagact ggtctttact 
ccacattaac aaagggaaac ataaaaaccc 
agttaatgat atcaacacat atgcatgatt 
gaaaggaaac ttaactagtt tgataacagg 
tacataatgg tagaaaactc agtgctgctg 
gctttgggag gcctaggcgg gcggatcacg 
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5881 aggtcaggag atcgagactg tcctgactag catgctgaaa ccccgtctct actaaaaata 
C " mcaM aaa "agccg ggcatggtgg cgggcgccta tagtlccagc Scqgqaq 
606 actaSf 9a9 \ at " C 9 tg-cccggg aggcggagct tgcagagc^t agatcSc 
6061 actgcactcc agcctgggtg acagagtgag acttcgtctc aaaaaaaaaa aaaaaaaaaa 

2 IZllZ" r 09 ^' *>tggcgggc gcctgl.Jtc ccagcuctc gggi«"» 

| a 35SK S55 S£= 22= ££= 
IIS ~= " =S5S =553 

oi cccattaaaa tattaaaaaa atagccaggc gttggccggg tacagtqqct catacctat- a 

ESS S-SS ~ ~ » 

S2£ SSSS ~ g 

c :: a ;s s:r aa,a " a,aa3aat t,9 <^ isittii 
i F< ~ — c « 
ss £ li 1E1 ~ == ~ 

7^1 = * " cca gcact tttaggaggc tgaggcaggt ggcttccctg qqaccaqcta 

7381 alTrTf 9 T aaCCCCa actctaataa aaatacaaaa aatcagccag gSggSq 
4 gcaSttq c'aor 39 " 9 CtCa " a9gc tgaggcagga gaattgcttg gacclgggS 

7SJ1 ccgatca^ S^cS: g ™ JESS gg t ~ < g ™ 
7681 a'"' 9 aatta 9 ct 9 33tgtagtgg tgggcgcctg ?L?cccagc tacttqqqaa 

E ~ = 535S == sS 

786 c?aate«S ™?T aaaaaa 3tcc taatggaaaa tccataaaaa gctaccaaaa 
/86i ctaataaata aatatagcag ggttgcaggt tacagggcaa tatagttatc cctctatcta 
7921 taggggcttg gttctgggac tcctcacaca ccaaacccac agatgtctaa gtcccaSa 

= = =s ::s=; as; 

ss sSs IE i2 ;~ =s 

RR9i 4. ^yyy^y^ tcaagaccag cctgactaac atggtgaaac cccatctrt-a 

llll Co' 9 ! 9 aaaatta 9 cc aggtgtggtg gcacatgctt gtagtJctag ctaScaS 
4 actqL?L Ca "gaatccgg gaggaggttg LgLgcgg agJtagtgcc 

ataaaltiaa SS? 9 " \ taaga 9 ct 9 agactccgtc tcaaaaLaa aLaataaa 
9061 gt a gggg at a tt'Sc^ -IT"" 9 aaaaga9CCt 3-attaaag gagctgtttg 
9121 ctgSaS aacactttoa 9 ° aaaCaaCa 9 aa aagtggg ctgggcacag tggttcatgc 
9181 aa „l ! agcactttgg gaggccaagg cgggcggatc acctgaagtt gggagttcaa 

942 gSa a g aa tc ttaqcSaa'c ST*"" a3Ca9aaaa 9 ^aacaaac acttacagL 
gg cgtttc ttagcaaatc tgatgacaaa tttggcataa agaaagagag catccctgaa 
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94 81 aaaaaaaaaa agaaaaagaa agagagcatc ctgcctgggc aacatagtga aaccctgcct 
9541 ctacaaaaaa actcaaaaat tggccgggtg cagtggctca cacctgtaat cccagcactt 
9601 tgggagtcgg aggcgggagg atcacctgag gtcaggagtt cgaaaccagc ctggccaaca 
9661 tggcaaaacc ccatctctac taaaaataca aaaaattaat caggcgcatt ggtgggcgcc 
9721 tgtaatccca gctactcagg aagttgaggc aagaggatcg cttgatactg ggaggtggag 
97 81 gttacagtga gtcgagatca caccactgca ctctagcctg ggtgacaggg cgagactccg 
9841 tctccaaaaa aaaaaagaaa aagaaaaaga ctaaaaaatt agccaggcag gcctctgtgg 
9901 tcccagctac ttgggaggct gaggcaggag aatcactgag cccaggagtg gcaggctgta 
9961 gtgagccatg attgcaccac tgtaccctag cttgggcttc aaagcaagac cctgcctcaa 
10021 aagaaaaaag aaagaaagaa agaacatggc gggccaggca cagtggctca cacctgtaat 
10081 cccagcgctt tgagaggccg aggcaggtgg atcacaaggt caggagttcc acaccagcct 
10141 ggccaacatg gtgaaaccct gtctctacta aaaatacaaa aaatcagcag gcagggtggt 

102 01 aggggcctgt aatcccagct actcgggagg ctgaggcagg agaattgctt gaaaccagaa 
10261 ggcagaggtt gcagtgagcc tagactgcac cactgcactc cagcctgggc gaaaagagcc 
10321 aaactccatc tcaaaaaaca aacaaaaaaa caaaacaaaa aaaaacatgg cagcctttga 

103 81 aagcttgtct gggagaaggt gcgatgatgg ttgcataact tcgtgcaaga tgctggtcca 
10441 cacaggggct gccccttgct ctttctcgct ctcttaacct ctcatataac aggcttgtgt 
10501 gttatgcaca tttattgagc ccaagcaggt gcaaggcatt gtgatctaat actttggtca 
10561 gcaagacaac aagatagatc actgccctgc ccttaggaag tgtatatgct attagaggaa 
10621 acagataaaa taaacaagga aaagtatcag acaatgtaag tgctatgaga atgcaaatga 
10681 ggtgatgtga attaaaatag gatgacttaa gtctgcacgg aaggccccta cccccatgtt 
10741 cctggctagc caaggaacca ccagttgatt agcagagaag ggcagcccgt ctagctagag 
10801 cttttgggga agagggagtg gttgttaaga gatgagatta aagaagccga gacgggccct 
10861 tcgtgagggg gggttgtaat gcagggctga ggagtgtccg aagagaatgg gcaggtgagc 
10921 ggtgagacag ttgttcttcc agaagctttg cagtgaaagg aatcaaagaa atggagccgt 
10981 gtatcaggtg gggaagggtg ggggccaagg gggtgtcctt ccccatacag agattgcagg 
11041 ctgagaatga ctatatcctt gttaacagga ggtgggagca gggcacggta gctcacacct 
11101 gtaatcttgg cactttagga ggcggaggcg ggccgatcac ctgaagtaag gagttcgaga 
11161 ccagcctggc caacatgcaa agccctgtct ctactaaaaa tacaaaaatt agctgggtgt 
11221 ggtggtactc gcctgtaatc ccagctactc gggagactga ggcaggagaa tggcttgaac 
112 81 ccggaaggta gaggttgcag tgagctgaga tcatgccact gtgctccagc ctaggtgaca 
11341 gagagagact ccatctcaaa aaaaaaaaaa aatacaggaa gggagttggg aatagggtgc 
11401 acatttagga agtcttgggg atttagtggt gggaaggttg gaagtccctc tctgattgtc 
114 61 ttttcctcaa agaagtgca^^ ggctggtgtg gggtggggca ggagtgcttg ggttgtggtg 
11521 aaacattgga agagagaatej tgaagcagcc attcttttcc tgctccacag gaagccgagc 
11581 tgtctcagac actggcaj^g tgttggggga ggggcctcct cctctocasjg cccaggtgac 
11641 ccagggttgg aagcgtctc a Jiq ctggatcc ccacttttcc tcttgc^cfca ^iccagactgc 
11701 ^Ltssaagjtc actqcc atg g agga^ccgc^gtcagatcct agcgtcgagc cccctctga£f 
11761 (§ca£$aaaca ttttcagacc t ajig gaaac Mgtgagtggat ccattggaag ggcaggccac 
11821 caccccgacc ccaaccccag cccccjtagca gagacctgtg ggaagcgaaa attcatggga 
11881 ctgactttct gctcttgtct ttca^g^tc ctgaaaacaa cgttc^cpta aggacaaggg 
11941 ttgggctggg acctggaggg ctggggc^gc tggggggctg aggacctggt cctctgactg 
12001 ctcttttcac ccatctacag^Ecccccttgc cgtcccaagc aatggatgat ttgatgctgt 
12061 ccccggacga tattgaacaa t^gttcactg aagacccagg tccagatgaa gctcccagaa 
12121 tgccagaggc tgctccccgc gtggcccctg caccagcagc tcctacaccg gcggcccctg 
12181 caccagcccc ctcctggccc ctgtcatctt ctgtcccttc ccagaaaacc taccagggca 
12241 gctacggttt ccgtctgggc ttcttgcatt ctgggacagc caagtctgtg acttgcacgol 
12301 tcagttgccc tgaggggctg gcttccatga gacttcaatg cctggccgta tccccctgca 
12361 tttcttttgt ttggaacttt gggattcctc ttcaccctta ggcttcctgt cagtgttttt 
12421 ttatagttta cccacttaat gtgtgatctc tgactcctgt cccaaagttg aatattcccc 
12481 ccttgaattt gggcttttat ccatcccatc acaccctcag catctctcct ggggatgcag 
12541 aacttttctt tttcttcatc cacgtgtatt ccttggcttt tgaaaataag ctcctgacca 
12601 ggcttggtgg ctcacacctg caatcccagc actctcaaag aggccaaggc aggcagatca 
12661 cctgagcccc aggagttcaa gaccagcctg ggtaacatga tgaaacctcg tctctacaaa 
12721 aaaatacaaa aaattagcca ggcatggtgg tgcacaccta tagtcccagc cactcaggag 
12 781 gctgaggtgg gaagatcact tgaggccagg agatggaggc tgcagtgagc tgtgatcaca 
12 841 ccactgtgct ccagcctgag tgacagagca agaccctatc tcaaaaaaaa aaaaaaagaa 
12901 aagctcctga ggtgtagacg ccaactctct ctagctcgct agtgggttgc aggaggtgct 
12961 tacacatgtt tgtttctttg ctgccgtgtt ccagttgctt tatctgttca cttgtgccct 
13021 gactttcaac tctgtctcct tcctcttcct acaglactcc cctgccctca acaagatgtt 

*3 



http://wv^.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retri 1/28/03 



NCBI Sequence Viewer 



Page 9 of 1 1 



13081 ttgccaactg gccaagacct gccctgtgca 
13141 cacccgcgtc cgcgccatgg ccatctacaa 
13201 gcgctgcccc caccatgagc gctgctcaga 
13261 acgacagggc tggttgccca gggtccccag 
13321 tctggcccct cctcagcatc ttatccgagt 
13381 tgacagaaac acttttcgac atagtgtggt 
13441 tgcaactggg gtctctggga ggaggggtta 
13501 cagtaggggg gctttctcct gctgcttatt 
13561 aagtaaatgg gtttaactat tgcacagttg 
13621 tcccctgctt ggctgggcgc agtggctcat 
13 6 81 ggcaggcgga tcacgaggtt gggagatcga 
13741 tctactgaaa aatacaaaaa aaaattagcc 
13 801 ctactcggga ggctgaggaa ggagaatggc 
13861 ctgagatcac gccactgcac tccagcctgg 
13921 aaaaaaaaag gcctcccctg cttgccacag 

13 981 ggcctgtgtt atctcctagg ttggctctga 
14041 taacagttcc tgcatgggcg gcatgaaccg 
14101 agactccagg tcaggagcca cttgccaccc 
14161 gcttgccgct gacccctggg cccacctctt 
14221 cctctcatca catttccggc gggaatctcc 
14281 ggctttggga cctcttaacc tgtggcttct 
14341 tccagaaagg acaagggtgg ttgggagtag 
14401 aggacctgat ttccttactg cctcttgctt 
14461 actgggacgg aacagctttg aggtgcgtgt 

14 521 agaggaagag aatctccgca agaaagggga 
14 5 81 taagcgaggt aagcaagcag gacaagaagc 
14641 tcagattcac ttttatcacc tttccttgcc 
14701 tcctctcccc agccaaagaa gaaaccactg 
14761 agtcttggga cctcttatca agtggaaagt 
14821 ttcttgactg ttttacctgc aattggggca 
14881 gacaatggct cctggttgta gctaactaac 
14941 attttaaagg accagaccag ctttcaaaaa 
15001 tctatgactt tgcctgatac agatgctact 
15061 cgtcgtaagt tgaaatattg aaatattgta 
15121 aaggaacatc atagcttagc ctagcctgct 
15181 tctcactctg ctacccaggc tggagtgcag 
15241 ccttctgggt tcaagcgatt ctcctgcctc 
15301 ctgccccgac gcccagctaa ttttttgtta 
15361 atttcaccat gttggccagg ctagtctcga 
15421 cctcccaaag tgctgggatt acaggcgtga 
15481 ttattttatt tttaatggag acagcatctt 
15541 gtgatcatag ctcattatac cctcctgggc 
15601 gtagctagga ccacaggcat acaccaccat 
15661 tagatagagt ctcactatgt tgcccaggct 
15721 cttgctctgt cacccaggct ggatagagtg 
15781 ccacctccca ggctccatcc atcctcccag 
15841 cctgcaccac catgcctggc taatttttat 
15901 ttttgagact cagtctcact ctgtcgcctt 
15961 cactgctaac ctctgcctcc tgggtttcaa 
16021 ctaggactac aagcgcctgc taccacgccc 
16081 ggtttcacca tgttggccag gctggtctcg 
16141 ctcccaaagt gctgggatta caggtgtgag 
16201 atttaaagac agagtctcac tctgtcactc 
16261 cactgcagcc ttgacctccc tgggctccgg 
16321 caggcacatg ccacgacacc cagctaattt 
16381 cgttgcccat gctggtatca aacccctggg 
16441 aagtattggg gttacaggca tgagctacca 
16501 cagagcattt aagttaccct acagttgggc 
16561 gtaataaaat gttgtatatc tcatgtgatt 
16621 agcatggttg catgaaagga ggcacagtcg 



gctgtgggtt gattccacac ccccgcccgg 
gcagtcacag cacatgacgg aggttgtgag 
tagcgat^jt gagcagctgg ggctggagag 
gcctctgatt cctcactgat tgctcttagg 
ggaaggaaat ttgcgtgtgg agtatttgga 
ggtgccctat gagccgcctg aggtctggtt 
agggtggttg tcagtggccc tccgggtgag 
tgacctccct ataaccccat gagatgtgca 
aaaaaactga agcttacgag gctaagggcc 
gcctgtaatc ccagcacttt gggaggccaa 
gaccatcctg gctaacggtg aaaccccgtc 
ggsegtgStS ctgggcacct gtagtcccag 
gtgaacctgg gcggtggagc ttgcagtgag 
gcgacagagc gagattccat ctcaaaaaaa 
gtctccccaa ggcgcactgg cctcatcttg 
ctgtaccacc atccactaca actacatgtg 
gaggcccatc ctcaccatca tcacactgga 
tgcacactgg cctgctgtgc cccagcctct 
accgatttct tccatactac tacccatcca 
ttactgctcc cactcagttt ccttttctct 
cctcccacct cctggagctg gagcttaggc 
atggagcctg gttttttaaa tgggacaggt 
ctcttttcct atcctgagta gtggtaatct 
ttgtgcctgt cctgggagag accggcgcac 
gcctcaccac gagctgcccc cagggagcac 
ggtggaggag accaagggtg cagttatgcc 
tctttcctag cactgcccaa caacaccagc 
gatggagaat atttcaccct tcaggtacta 
ttccagtcta acactcaaaa tgccgttttc 
tttgccatca gggggcagtg atgcctcaaa 
ttcagaacac caacttatac cataatatat 
gaaaatagtt aaagagagca tgaaaatggt 
tgacttacga tggagttact tctgataact 
agttgaaaat ggatttaata cacctaatct 
tttttttttt tttttttttt ggagacagag 
tggcgggatc tcggctcact gcaacctccg 
agcccactga gtagctggga ttacaggcac 
tttatttcct ttttttttag tagagataga 
actcctgacc ttgtgatctg cctgccttgg 
gccaccgcac ctggcctgcc tagcctactt 
gctctgttgc ccaggctgga ttacagtgat 
tcaagcaatc cccctaactc tgcctcccca 
acccagctaa tttttaaaat tttttgtaga 
ggtctctagc ctactttttt gagacaaggt 
cagtagtgca gtcacagctc actgcagcct 
ctcagcctcc caagttgctt caactacagg 
ttatttattt ttattttatt ttattttatt 
aggctggagt gcagtggcat gatctcggct 
gtgattctcc tgcctcagcc tcccgaatag 
ggctaatttg tgtattttta gtagagacag 
aacttctgac catgtgatcg ccgcctcggc 
ccaccacgcc cggctaattt ttatttattt 
aggctagagt gcagtggcac catctcagct 
tgatttcacc ctcccaagta gctaggacta 
tttattttct gtgaagtcaa ggtcttgcta 
ctcaatcaat ccttccacct cagcctcccc 
cactcagccc tagcctactt gaaacgtgtt 
aaagtcatct aacacaaagc cctttttata 
tattagatat tgttactaaa agtgagaaac 
aagccaggca cagcctgggc gcagagcgag 
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16681 actcaaaaaa agaaaaggcc aggcgcactc 
16741 ctgaggcggg tggatcacct gaggtcagga 
16801 accccgtctc tactaaaata caaaaattaa 
16861 agctacttgg gaggctgagg caggagaatc 
16921 gccaagacgg cgccactgca ctccagcctg 
16981 aaaagaaaag aaacgaggca cagtcgcatg 
17041 aggcaggagg atctcttgag cccaagagtt 
17101 tcatctctaa aatttaaaaa agggccgggc 
17161 ttgggaggtg gaggtgggta gatcacctga 
17221 atggtgaagc cccatctcta ctaaaaacac 
172 81 ccacgtactc ggaggctgag gcacaagaat 
17341 cagccaagat tgcaccattg cactcccgcc 
17401 ataaataaat aaatattttt aaaagtcagc 
17461 aaatcgatgt tgcttttgat ccgtcataaa 
17521 aactcaggta ctgtgaatat acttacttct 
17581 cgtgagcgct tcgagatgtt ccgagagctg 
17641 gctgggaagg agccaggggg gagcagggct 
17701 tggccctact cccctgcctt cctaggttgg 
17761 ttcatggtca aaggcagctg accccatctc 
17821 ttagtcttcc tccggttgaa tcctataacc 
17881 acatccaaac ccagggacga gtgtggatac 
17 941 ccagagctgg agggtctcaa ggggcctaat 
18001 tcaggtcata tactcagccc tgccatgcac 
18061 cagtttcctt atatataaaa tggggtaagg 
18121 actctgggag gccaaggcga gtggatcacc 
18181 aacatggaga aaccccatct ctactaaaaa 
18241 cctgtaatcc cagctaccta ctcgggaggc 
18301 gcagaggttg cggtgagctg agatctcacc 
18361 aactccgtct caaaaaagta taataaagta 
18421 tacacactaa tactctgagg tgctcagtaa 
18481 tgatttgaat tcccgttgtc ccagccttag 
18541 gcacagaccc tctcactcat gtgatgtcat 
18601 acctgaagtc caaaaagggt cagtctacct 
18661 aagggcctga ctcagactga cattctccac 
18721 ccatctctcc ctcccctgcc attttgggtt 
18781 tgtgcgtcag aagcacccag gacttccatt 
18841 tggcctgcac tggtgttttg ttgtggggag 
18901 attttaaggt ttttactgtg agggatgttt 
18961 gggttagttt acaatcagcc acattctagg 
19021 agggaagctg tccctcatgt tgaattttct 
19081 tggcatttgc acctacctca cagagtgcat 
19141 gccttgaaac caccttttat tacatggggt 
19201 tccctctccc tctccctgtt ggctggtggg 
19261 aggtagaggg agttgtcaag tcttgctggc 
19321 tcgaccttag tacctaaaag gaaatctcac 
19381 ttgtatatga tgatctggat ccaccaagac 
19441 tctttttttt tttttttttt ctttttcttt 
19501 ggagtggagt ggcgtgatct tggcttactg 
19561 tgcctcagcc tccggagtag ctgggaccac 
19621 tgcatgtttt gtagagatgg ggtctcacag 
19681 ctcaggcgat ccacctgtct cagcctccca 
19741 cgtggagctg gaagggtcaa catcttttac 
19801 acccttcccc tccttctccc tttttatatc 
19861 taaaactttg ctgccacctg tgtgtctgag 
19921 agcaggtgca ggggtgagtg aggaggtgct 
19981 gtggactggg gggcccagtg cccgggttcc 
2 0041 agtctgcggg ctgcatgaca acaagggagg 
20101 ccgtccctcc tcccctccgc ccacggctgg 
20161 gattgggctg cttccccctc ggcagcctct 
20221 atttacgaag gctgagcctc cagggtggcc 



tcacgcctgt aatcccagca tttcgggagg 
gttcaagacc agcctagcca acatggtgaa 
ccgggcgtga tggcaggtgc ctgtaatccc 
gcttgaacca ggaggcggag gttgcaggga 
ggcgatagag tgagactccg tctcagaaaa 
cacatgtagt cccagttact tgagaggcta 
tgagtccagc ctgaacaaca tagcaagaca 
acagtggctc acacctgtaa tcccagcact 
cgtcaggagt tggaaaccag cctggctaac 
aaaaattagc cagtgtgaga cacgttgagt 
cacttgaacc ccagaggcgg agattcgaat 
tgggcgacga gagtgagacc ccatctcaaa 
tgtataggta cttgaagtgc agtttctact 
gtcaaacaat tgtaacttga accatctttt 
ccccctcctc tgttgctgca gatccgtggg 
aatgaggcct tggaactcaa ggatgcccag 
cactccaggt gagtgacctc agccccttac 
aaagccatag gattccattc tcatcctgcc 
attgggtccc agccctgcac agacattttt 
acattcttgc ctccacgtag tatccacaga 
ttctttgcca ttctccgcca actccccagc 
aattgtgtaa tactgaatac agccagagtt 
cggcaggtcc taggtgaccc ccgtcaaact 
gggccgggcg cagtggctca cgaatcccac 
tgaggtcggg agtttgagcc cagcctgacc 
tacaaaagta gccgggcgtg gtgatgcatg 
tgaggcagga gaatcgcttg aacccgggag 
attacactcc agcctgggca acaagagtga 
aaatggggta agggaagatt acgagactaa 
acatatttgc atggggtgtg gccaccatct 
gcccttcaaa gcattggtca gggaaaaggg 
ctctcctccc tgcttctgtc tcctacagcc 
cccgccataa aaaactcatg ttcaagacag 
ttcttgttcc ccactgacag cctccctccc 
ttgggtcttt gaacccttgc ttgcaatagg 
tgctttgtcc cggggctcca ctgaacaagt 
gaggatgggg agtaggacat accagcttag 
gggagatgta agaaatgttc ttgcagttaa 
taggtagggg cccacttcac cgtactaacc 
ctaacttcaa ggcccatatc tgtgaaatgc 
tgtgagggtt aatgaaataa tgtacatctg 
ctaaaacttg acccccttga gggtgcctgt 
ttggtagttt ctacagttgg gcagctggtt 
ccagccaaac cctgtctgac aacctcttgg 
cccatcccac accctggagg atttcatctc 
ttgttttatg ctcagggtca atttcttttt 
gagactgggt ctcgctttgt tgcccaggct 
cagcctttgc ctccccggct cgagcagtcc 
aggttcatgc caccatggcc agccaacttt 
tgttgcccag gctggtctca aactcctggg 
gagtgctggg attacaattg tgagccacca 
attctgcaag cacatctgca ttttcacccc 
ccatttttat atcgatctct tattttacaa 
gggtgaacgc cagtgcaggc tactggggtc 
gggaagcagc cacctgagtc tgcaatgagt 
gggaggggaa caaaggctgg agactgggtc 
gggtggctcc attcataact caggaaccaa 
cacaaggttc tctgcctccc ctgcttctag 
caccaaggat tacgggattt aaatgtcgtg 
atcttcgtcc atcagaagtg gcaggatacc 
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202 81 tgggttccaa gggaacaggg tgg 

// 
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